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Remarks 

In the present response, no claims are amended or canceled. Claims 1-4 and 6-18 
are presented for examination. 

L Claim Rejections: 35 USC § 102 

Claims 1, 6, and 10 are rejected under 35 USC § 102 as being anticipated by an 
article entitled "Mercator: A scalable, extensive Web Crawler" by Heydon et al. 
(hereinafter Heydon). This rejection is traversed. 

A proper rejection of a claim under 35 U.S.C. §1 02 requites that a single prior art 
reference disclose each element of the claim. See MPEP § 2131, also, fV.L. Gore & 
Assoc., Inc. v. Oarlock, Inc., 721 F.2d 1540, 220 U.S.P.Q. 303, 313 (Fed. Cir, 19S3). 
Since Heydon neither teaches nor suggests each element in claims 1 , 6, and 1 0, these 
claims are allowable over Heydon, 

Response to Final Office Action 

In Applicant's response dated 1 5 July 2004, Applicant argues that Heydon does 
not teach or suggest a "plurality of web crawlers." The Final Office Action disagrees and 
states: 

Heydon's worker threads are each considered separate "web 
crawlers" as each thread performs the function of a ih web crawler." 
Nowhere has applicant provided a definition of "web crawler" that 
goes above and beyond the conventional definition, that would 
distinguish from Heydon's worker thread. (Final OA, pages 13- 
14). 

Applicant respectfully disagrees. Applicant uses the terms "web crawler" and 
"thread" in the plain meaning given to these terms per one of ordinary skill in the art (see 
MPEP 2111.01: Words of claim must be given their plain meaning). Webopedia (see 
www.webopedia.com) is an online dictionary dedicated to defining computer and internet 
related terms. Webopedia defines 4 Veb crawler" and "thread" as follows: 
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Web crawler (note: web crawler and spider are synonyms): 

A program that automatically fetches Web pages . Spiders are 
used to feed pages to search engines . It's called a spider because it 
crawls over the Web . Another term for these programs is 
webcrawler. 

Thread; 

(2) Tn programming , a part of a program that can execute 
independently of other parts. Operating systems that support 
multithreading enable programmers to design programs whose 
threaded parts can execute concurrently. 

Thus, a "web crawler" is a program that automatically fetches web pages. A 
"thread" is part of a program that can execute independently of other parts. Even 
Applicant's specification states: "A web crawler is a program that automatically finds and 
downloads documents from host computers in an Intranet or the world wide web" (see p. 
1, lines 24-25). 

Applicant respectfully asserts that the Office Action has not applied the terms 
"web crawler" and "thread" in accordance with their plain meaning. As noted above, the 
Office Action equates Heydon's threads" as being "web crawlers*' (see quote above of 
Final OA, pages 13-14: "Heydon's worker threads are each considered separate web 
crawlers. . ..")• The Office Action utilizes the terms "web crawler" and '"thread" in a 
manner that is repugnant to the plain meaning of these terms. 

Claim 1 . 

Independent claim 1 recites numerous limitations that are not taught or suggested 
in Heydon. For example, claim 1 recites "a plurality of web crawlers." By contrast, 
Heydon does not teach or suggest a plurality of web crawlers. Heydon teaches a single 
web crawler (see Abstract: 'This paper describes Mercator, a scalable, extensible web 
crawler ...."). Section 3.1 paragraph 1 does state: "Crawling is performed by multiple 
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worker threads." Multiple worker threads, though, are noL a plurality of web crawlers. In 
fact, Fig. 1 of Heydon teaches a single web crawler. 

As another example, claim 1 recites "assigning a web crawler identifier to each 
one of the plurality of web crawlers." Heydon does not teach or suggest this limitation. 
The Office Action cites Section 3.2, third paragraph of Heydon for teaching this 
limitation. This section of Heydon teaches that Mercators' URL frontier includes distinct 
FIFO subqueues; one FIFO subqueue per worker thread. This section further states: 
"Second, when a new URL is added, the FIFO subqueue in which it is placed is 
determined by the URL's canonical host name." No where does this section teach or 
suggest assigning an identifier to each one of a plurality of web crawlers. 

As another example, claim 1 recites: 

determining a web crawler identifier to which the 
representation corresponds; and 

when the determined web crawler identifier is not 
assigned to the respective web crawler, sending the 
identified address to the web crawler to which the 
determined web crawler identifier is assigned. 

Heydon does not teach or suggest these limitations. The Office Action repeatedly 
, cites Section 3,2, This section of Heydon teaches a data structure (URL frontier) that 

contains all the URLs that remain to be downloaded within a single web crawler. The 
claimed limitations in claim 1, though, arc not shown or suggested. 

Claim 6 

Independent claim 6 recites numerous limitations that are not taught or suggested 
in Heydon. For example, claim 6 recites "a plurality of web crawlers." By contrast, 
Heydon does not teach or suggest a plurality of web crawlers. Heydon teaches a single 
web crawler (see Abstract: 'This paper describes Mercator, a scalable, extensible web 
crawler . ...**). Section 3.1 paragraph 1 does state: "Crawling is performed by multiple 
worker threads/* Multiple worker threads, though, are not a plurality of web crawlers. In 
fact, Fig. I of Heydon teaches a single web crawler. 
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As another example, claim 6 recites "wherein each web crawler has been assigned 
a web crawler identifier." Heydon does not teach or suggest this limitation. The Office 
AcLion cites Section 3.2, third paragraph of Heydon for teaching this limitation. This 
section of Heydon teaches that Mercators' URL frontier includes distinct FIFO 
subqueues; one FIFO subqueue per worker thread. This section further states: "Second, 
when a new URL is added, the FIFO subqueue in which it is placed is determined by the 
URL's canonical host name." No where does this section teach or suggest a plurality of 
web crawlers with each web crawler assigned a web crawler identifier. 

As another example, claim 6 recites: 

for each respective web crawler: 

a main web crawler module . . . 

determining a web crawler identifier to which the 
representation corresponds; and 

when the determined web crawler identifier is not 
assigned to the respective web crawler, sending the 
identified address to a destination web crawler comprising 
the web crawler to which the determined web crawler 
identifier is assigned. 

Heydon does not teach or suggest these limitations. The Office Action repeatedly 
cites Section 3.2. This section of Heydon teaches a data structure (URL frontier) that 
contains all the URLs that remain to be downloaded within a single web crawler. The 
claimed limitations in claim 6, though, are not shown or suggested. 

Claim 10 

Independent claim 10 recites numerous limitations that are not taught or 
suggested in Heydon. For example, claim 10 recites: 

determining a web crawler identifier to which the 
representation corresponds; and 

when the determined web crawler identifier is not 
assigned to the respective web crawler, sending the 
identified address to a destination web crawler comprising 
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the web crawler to which the? determined web crawler 
identifier is assigned, 

Heydon does not teach or suggest these limitations. The Office Action repeatedly 
cites Section 3.2. This section of Heydon teaches a data structure (URL frontier) that 
contains all the URLs that remain to be downloaded within a single web crawler. The 
claimed limitations in claim 10, though, are not shown or suggested. 

IL Claim Rejections: 35 USC § 102(e) 

Claims 1-4 and 6-14 are rejected under 35 USC § 102(e) as being anticipated by 
Najork et aL (USPN 6,377,984, hereinafter Najork). This rejection is traversed. 

A proper rejection of a claim under 35 U.S.C § 1 02(e) requires that a single prior 
art reference disclose each element of the claim. See MPEP § 2131, also, W.L. Core & 
Assoc., Inc. v. Garlock, Inc., 721 F.2d 1540, 220 U.SJP.Q. 303, 313 (Fed. Cir. 1983). 
Since Najork neither teaches nor suggests each element in claims 1-4 and 6-14 y these 
claims are allowable over Najork. 

Response to Final Office Action 

In Applicant's response dated 15 July 2004, Applicant argues that Njork does not 
teach or suggest a "plurality of web crawlers/* The Final Office Action disagrees and 
states: 

The Examiner disagrees for the same reasons discussed above 
with regard to Heydon, Namely, each of Najork' s worker threads is 
considered equivalent to a 'Sveb crawler" as claimed. (Final OA, 
page 14). 

Applicant restates the argument above: The Office Action has not applied the 
terms 4t web crawler" and "thread" in accordance with their plain meaning. The Office 
Action equates Njork's '"threads" as being "web crawlers 4 * (see quote above of Final OA, 
page 14: '"Namely, each of Najork' s worker threads is considered equivalent to a '*web 



11 



PAGE 14/21 * RCVD AT 3/14/2005 1:37:03 PM [Eastern Standard Time] * SVfcUSPTO-EFXRMff 1 DNIS:8729306 * CSID:281 514 7137 * DURATION (miiKS):0M8 



MAR 14'05 12:41 FR HP LEGAL 



281 514 713? TO 917038729306 



P. 15 



crawler" as claimed"). The Office Action utilizes the terms "web crawler" and "thread" 
in a manner that is repugnant to the plain meaning of these terms. 

Claim 1 

Independent claim 1 recites numerous limitations that are not taught or suggested 
in Najork. For example, claim 1 recites "a plurality of web crawlers." By contrast, Najork 
does not teach or suggest a plurality of web crawlers. Najork teaches a single web 
crawler 

FIG. 1 shows an exemplary embodiment of a distributed 
computer system 100. The distributed computer system 100 
includes a web crawler 102 connected to a network 103 
through a network interconnection 110. (Col. 3, lines 60- 
63: emphasis added). 

Fig. 1 of Najork shows a web crawler 102 with memory 1 18 that includes threads 
130. Multiple threads, though, are not a plurality of web crawlers. In fact, Fig. 1 of 
Najork teaches a single web crawler. 

As another example, claim 1 recites "assigning a web crawler identifier to each 
one of the plurality of web crawlers." Najork does not teach or suggest this limitation. 
The Office Action cites identifier 'V (Figs, 2-4) to teach one of a plurality of web 
crawlers (each thread being a crawler, see also Fig. 3B), These figures and accompanying 
description do not leach or suggest assigning an identifier to each one of a plurality of 
web crawlers. By contrast, the figures arc generally directed to FIFO queues for a single 
web crawler. 

As another example, claim 1 recites: 

determining a web crawler identifier to which the 
representation corresponds; and 

when the determined web crawler identifier is not 
assigned to the respective web crawler, sending the 
identified address to the web crawler to which the 
determined web crawler identifier is assigned. 
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Najork does not teach or suggest these limitations. The Office Action cites Steps 
302-304, 508 and 306 7 510, 554. These steps in Najork are generally directed to FIFO 
queues for URLs downloaded within a single web crawler. The claimed limitations in 
claim 1, though, are not shown or suggested. 

Dependent claims 2-4 depend from claim 1 and thus inherit all the limitations of 
base claim 1 . As such, claims 2-4 are also allowable over Najork. Further, these 
dependent claims contain numerous limitations not taught or suggested in Najork. 

Claim 6 

Independent claim 6 recites numerous limitations that are not taught or suggested 
in Najork. For example, claim 6 recites "a plurality of web crawlers." By contrast, Najork 
does not teach or suggest a plurality of web crawlers. Najork teaches a single web 
crawler. 

FIG- 1 shows an exemplary embodiment of a distributed 
computer system 100. The distributed computer system 100 
includes a web crawler 102 connected to a network 103 
through a network interconnection 1 10, (Col. 3, lines 60- 
63: emphasis added). 

Fig. 1 of Najork shows a web crawler 1 02 with memory 1 1 S that includes threads 
130. Multiple threads, though, arc not a plurality of web crawlers. In fact, Fig. 1 of 
Najork teaches a single web crawler. 

As another example, claim 6 recites t4 whcrein each web crawler has been assigned 
a web crawler identifier." Najork does not teach or suggest this limitation. Najork does 
not teach or suggest this limitation. The Office Action cites identifier "r" (Figs. 2-4) to 
each one of a plurality of web crawlers (each thread being a crawler, see also Fig. 3B). 
These figures and accompanying description do not teach or suggest assigning an 
identifier to teach one of a plurality of web crawlers. By contrast, the figures are 
generally directed to FIFO queues for a single web crawler. 

As another example, claim 6 recites; 

for each respective web crawler: 
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a main web crawler module . 

determining a web crawler identifier to which the 
representation corresponds; and 

when the determined web crawler identifier is not 
assigned to the respective web crawler, sending the 
identified address to a destination web crawler comprising 
the web crawler to which the determined web crawler 
identifier is assigned. 

Najork does not leach or suggest these limitations. The Office Action cites Steps 
302-304, 508 and 306, 510, 554. These sieps in Najork are generally directed to FIFO 
queues for URLs downloaded within a single web crawler. The claimed limitations in 
claim 6, though, are not shown or suggested. 

Dependent claims 7-9 depend from claim 6 and thus inherit all the limitations of 
base claim 6. As such, claims 7-9 are also allowable over Najork. Further, these 
dependent claims contain numerous limitations not taught or suggested in Najork. 

Claim 10 

fndependent claim 1 0 recites numerous limitations that ore not taught or 
suggested in Najork. For example, claim 10 recites: 



determining a web crawler identifier to which the 
representation corresponds; and 

when the determined web crawler identifier is not 
assigned to the respective web crawler, sending the 
identified address to a destination web crawler comprising 
the web crawler to which the determined web crawler 
identifier is assigned. 

Najork does not teach or suggest these limitations. The Office Action cites Steps 
302-304, 508 and 306, 5 10, 554. These steps in Najork arc generally directed to FIFO 
queues for URLs downloaded within a single web crawler. The claimed limitations in 
claim 10 are not shown or suggested. 
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Dependent claims 1 1-14 depend from claim 10 and thus inherit all the limitations 
of base claim 10. As such, claims 1 1-14 are also allowable over Najork. Further, these 
dependent claims contain numerous limitations not taught or suggested in Najork. 

III. Claim Rejections: 35 USC § 102(f) 

Claims 1-4 and 6-14 are rejected under 35 USC § 102(f) because applicant did not 
invent the claimed subject matter. This rejection is traversed. 

Applicant alone invented the claimed subject matter. In Section I of this response, 
Applicant has demonstrated that the claimed subject matter is patentable over Heydon, 

IV. Claim Rejections: 35 USC § 102(e) 

Claims 1-4 and 6-14 are rejected under 35 USC § 102(e) as being anticipated by 
Eichstaedt et aL (USPN 6,182,085, hereinafter Eichstaedt). This rejection is traversed. 

A proper rejection of a claim under 35 U.S.C. §1 02(e) requires that a single prior 
art reference disclose each element of the claim. See MPEP § 2131, also, W.L. Gore & 
Assoc., Inc, v. Garlock, Inc., 721 F.2d 1540, 220 U.S.P.Q. 303, 313 (Fed. Cir. 1983). 
Since Eichstaedt neither teaches nor suggests each element in claims 1 -4 and 6-14, these 
claims are allowable over Eichstaedt. 

Claim 1 

Independent claim 1 recites numerous limitations that are not taught or suggested 

in Eichstaedt. For example, claim 1 recites "assigning a web crawler identifier to each 

one of the plurality of web crawlers." Eichstaedt does not teach or suggest this limitation. 

The Office Action cites Col. 10 and gatherer processor id "i" as teaching this limitation. 

This section of Eichstaedt teaches how to partition the web-graph among numerous 

gatherers or processors: 

Assuming that one version of the present invention has k 
"gatherers" or processors, the web-graph is divided into k 
sub-graphs W l? . . . W^. Each sub-graph is mapped to a 
processor (e.g., W : to processor i). (Col. 10, lines 18-21). 
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Thus, this section of Eichstaedt teaches how to divide the web-graph between 
processors. This section does not teach or suggest assigning a web crawler identifier to 
each processor or gatherer. 

As another example, claim 1 recites downloading data sets and identifying 
addresses of one or more referred data sets. For each identified address, claim 1 
specifically recites "'generating a representation of the host computer identifier" and 
"'determining a web crawler identifier to which the representation corresponds/* 
Eichstaedt does not teach this limitation. As noted, Eichstacdt does not assign web 
crawler identifiers to each gatherer or processor. As such, Eichstaedt does not generate a 
representation of a host computer identifier and then determine a web crawler identifier 
to which the representation corresponds. 

The Office Action relies on Fig, 6 and Col. 6 (for example, lines 39-67 and 2-38). 
These sections in Eichstaedt teach dividing the web-space (the URL space) into sub- 
spaces and assigning sub-spaces to certain processors (Col. 6, lines 30-32). When new 
URLs are added to the web-space, the processor processes URLs belonging to its sub- 
space and routes other URLs (i.e., those not belonging to its sub-space) to the proper 
processor (Col. 6, lines 33-38). Notice though, that Eichstaedt does not generate a 
representation of a host computer identifier and then determine a web crawler identifier 
to which the representation corresponds. 

Dependent claims 2-4 depend from claim 1 and thus inherit all the limitations of 
base claim L As such, claims 2-4 are also allowable over Eichstaedt. Further, these 
dependent claims contain numerous limitations not taught or suggested in Eichstaedt. 

Claim 6 

Independent claim 6 recites numerous limitations that are not taught or suggested 
in Eichstaedt. For example, claim 6 recites 'Svherein each web crawler has been assigned 
a web crawler identifier." For the reasons discussed above in connection with claim I, 
Eichstaedt does not teach or suggest this limitation. 

As another example, claim 6 recites a main web crawler module that identifies 
addresses of referred data sets, wherein each identified address includes a host computer 
identifier. An address distribution module processes the identified addresses and includes 
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instructions for "generating a representation of the host computer identifier" and 
"determining a web crawler identifier to which the representation corresponds/* For the 
reasons discussed above in connection with claim 1, Eichstaedt does not teach or suggest 
this limitation. 

Dependent claims 7-9 depend from claim 6 and thus inherit all the limitations of 
base claim 6. As such, claims 7-9 are also allowable over Eichstaedt. Further, these 
dependent claims contain numerous limitations not taught or suggested in Eichstaedt. 

Claim 10 

Independent claim 10 recites numerous limitations that are not taught or 
suggested in Eichstaedt. For example, claim 10 recites 'Vherein each web crawler has 
been assigned a web crawler identifier." For the reasons discussed above in connection * 
with claim 1, Eichstaedt does not teach or suggest this limitation. 

As another example, claim 10 recites a main web crawler module that identifies 
addresses of referred data sets, wherein each identified address includes a host computer 
identifier. An address distribution module processes the identified addresses and includes 
instructions for "generating a representation of the host computer identifier" and 
"determining a web crawler identifier to which the representation corresponds." For the 
reasons discussed above in connection with claim 1, Eichstaedt does not teach or suggest 
this limitation. 

Dependent claims 1 1-14 depend from claim 10 and thus inherit all the limitations 
of base claim 10. As such, claims 11-14 arc also allowable over Eichstaedt. Further, these 
dependent claims contain numerous limitations not taught or suggested in Eichstaedt. 

V. Claim Rejections: 35 USC § 103 

Claims 15-18 are rejected under 35 USC § 103 as being unpatentable over 
Eichstaedt in view of Najork. Applicant respectfully traverses. 

To establish a prima facie case of obviousness, three basic criteria must be met. 
First, there must be some suggestion or motivation, either in the references themselves or 
in the knowledge generally available to one of ordinary skill in the art to modify the 
reference or to combine reference teachings. Second, there must be a reasonable 
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expectation of success. Finally, the prior art cited must teach or suggest all the claim 
limitations. See MJP.E.P. § 2143. Applicant asserts that ihe rejection does not satisfy 
these criteria. 

As one example, section II above discusses numerous claim elements that are not 
taught or suggested in Najork, and section TV above discusses numerous claim elements 
that are not taught or suggested in Eichstaedt. The combination of Najork and Eichstaedt 
does not cure the noted deficiencies, in other words, for at least the reasons noted in 
sections II and IV, the combination of Najork and Eichstaedt does not teach all the claim 
limitations. 

CONCLUSION 

In view of the above, Applicant believes all pending claims are in condition for 
allowance. Allowance of these claims is respectfully requested 

Any inquiry regarding this Amendment and Response should be directed to Philip 
S. Lyren at Telephone No. (281) 514-8236, Facsimile No. (281) 514-8332. In addition, 
all correspondence should continue to be directed to the following address: 



Hewlett-Packard Company 

Intellectual Property Administration 
P.O. Box 272400 

Fort Collins, Colorado 80527-2400 



Respectfully submitted, 




Philip S. Lyren 
Reg. No. 40,709 
Ph: 281-514-8236 



CERTIFICATE UNDER 37 C.F.R. 1.8 
The undersigned hereby certifies that this paper or papers, as described herein, is being transmitted to the United States 
Patent and Trademark Office facsimile number 703-872-9306 on this _/#?^day of March, 2005. 
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